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(54) Method and apparatus for logging web user activity using client-side support 



(57) Methods and apparatus tor traokngcientacfcr- 
»y associated with a browser running on a cient device 
in conjunction with a server, wherein the client device 
and the server are in a computer network, are provided. 
In an fllustrative embocfiment a rrielhod employs dien*- 
side executable content that is dynamically downloaded 
into the browser, such as an applet The cfient-stte exe- 
cutable content or applet is referred to as an "agent " 
The agent is preferably downloaded into the browser 
along with a document requested by the dent and 
returned by the server. Once estabfished h the browser, 
the agent keeps track of user interaction that occurs in 
the browser, that is, client-side interaction. The granu- 
larity of this tracking can be regulated, allowing the 
agent in the browser to collect as much information as is 
avertable, or some subset thereof. The agent then peri- 
odically notifies a server-side entity referred to as a Tog- 
ger" of the information that it has collected Trie logger 
stores the information in a log database for subsequent 
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Description 

Field of the Invention 

[0001] The present invention relates to methods 
and apparatus for logging user activity on a computer- 
based information network and, more particularly, to 
unfied logging of web user activity using client-side 
support 

Background of tfafl invention 
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100021 A browser is a software program run on a 
computer that allows a computer user to request, view, 
manipulate and/or send information across a computer is 
network. For example, a web-based browser serves as 
a user** front-end interface to the World Wide Web 
(WWW) on the Internet As is known, using a web- 
based browser, a user types in or selects the adoYess or 
URL (Uniform Resource Locator) of a website to 20 
revest a webpage (ag., homepage) associated with 
the website that he wishes to visit The browser trans- 
mits the revest over the Irtemet to trie web server with 
which the website is associated and the server down- 
loads the homepage to the browser tor viewing by the 25 
user. The homepage is typically an index to other pages 
on that website that can be accessed by cScJdng on 
joorKwhic^ 

1000?! Typicafy. at the web server, the incoming 
URL request from the web browser is logged (Le^ ao 
stored). The web server then performs c omputations 
appropriate to fuffifing the request generates a 
response, and transmits or downloads this information 
to the web browser. A fog entry is generated when the 
readiest is received and typically contains a record off as 
attributes associated with the request such as, for 
example, when the request was received, who initiated 
the request the name of the requested object ete. 
[0004] However, such a con v entional web server 
logging technique has many shortcomings. For exam- 40 
pie, information captured by a web server log is only 
generic web server information, that is, trie -who, what 
and when - associated with the request Thus, every 
common gate interface (CGI) program running on the 
web server or, for that matter, any executable entity that 45 
oynarnicalfy generates content has to do as own tog- 
ging. Further, the manner in which the program or other 
executable entity accomplishes the logging is depend- 
ent on the web server. Also, web server fogs only cap- 
ture information at the time a request is subrrvtted to the so 
server. Therefore, existing web server togs do not cap- 
ture cfient-side (La, end user) activity. As a result off 
such shortcomings, web server fogging is dfejointed in 
that separate fogs are created for various programs. 
Furthermore, the web server and, more particularly, the ss 
party operating the web server, does not receive any 
information regarding how end users interact with the 
information the web server provides. 



Summary off the invention 

[00061 The present invention provides methods and 
apparatus for permitting unified logging and cfient-side 
activity logging. As wSi be evident such fogging is uni- 
fied in that the techniques of the invention are not 
dependent on the server, CGI programs, or other exe- 
cutable content associated with the server. Also, infor- 
mation relating to client-side activity is advantageously 
logged at the server throughout a user's interaction witfi 
a downloaded ctocument in adcfitton to any generic web 
server information fogged at the time of a ctocument 



[00061 tn en illustrative embafment the invention 
accomplishes these and other objects and advantages 
by employing cfient-side executable content that is 
dynamkafly downloaded into the browser, such as an 
applet As is known, an applet is a fimfted-function. un- 
ity program. The cfient-side executable content or 
applet is referred to as an "agent" The agent is prefera- 
bly downloaded into a browser running on the client 
device along with a document requested by the cfient 
and returned by the server. Once established In the 
browser, the agent keeps track off user interaction that 
occurs in the browser, that is, client-side interaction. 
The granularity of this tracking can be regulated, allow- 
ing the agent in the browser to collect as much informa- 
tion as is available, or some subset thereof. The agent 
then perkxficalry notifies a server-side entity referred to 
as a logger of the information that ft has a 
logger stores the irTforrnation in a log database tor sub- 
sequent use. Advantageously, the present invention 
may be implemented with unmodified standard web 
browse rs that support executable content 
[0007] These and other objects, features and 
advantages of the present invention wffl become appar- 
ent from trie Wtowing detailed description of fflustrative 
errfcodlments thereof, which is to be read in connection 
wfth the accxxnpanytng drawings. 

Prief Description off the Drawings 

BXXJ8] 

F9GL 1 is a block diagram illustrating an exemplary 
embodirnent of a web user activity toogjng arrange- 
ment according to the present invention; 

RG. 2 is a block otagram illustrating a cfient-server 
system in which web user activity togging acoorrfing 
to the invention may be implemente d ; 

FK3. 3 is a flow chart frustrating a web user activity 
fogging method according to an exemplary ernbod- 
iment of the present invention; and 

FKx 4 is a flow chart illustrating an agent data col- 
lecting method according to an exemplary embodi- 
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merit of the present invention 
Pete lied Description of Preferred Embodiments 

[0009] The inventkxiwffl be described 
junction with an exemplary web-based browser and web 
server arrangement that is compatible with standard 
web-based browsers such as, for example, the NET- 
SCAPE browser (available from Netscape Communica- 
tions Corp oration ), the standard hypertext transfer 
protocol (HTTP), and hypertext markup language 
(HTML). AHhoupJi particularly weft-suited for implemen- 
tation ewer the Internet using Transmission Control Pro- 
tocoWntemet Protocol (TCP/IP) connections, the 
inventkxi is not Gmfted to use with any particular type of 
network or network communication protocol. The en- 
closed techniques are suitable for use with a wide vari- 
ety of other networks and protocols, and many different 
types erf brtawsers installed on cferrt computers. Further, 
the term "processor as used herein is intended to 
include any processing device, including a CPU (central 
processing unit), which may be utilized in a cfent com- 
puter, server computer, or any other computer 
employed in conjunction with tie invention. Also, the 
term "memory- as used herein is intended to include 
memory associated with a processor or CPU, such as, 
for example, RAM, ROM. a fixed memory device (e.g., 
hard drive), or a removable memory device (ag., dis- 
kette). In adoption, the p roce ssi ng device may include 
one or more input devices, e-g„ keyboard, for inputting 
data to the processing unit as wefl as one or more out- 
put devices, e.g., CRT display andAor printer, for provid- 
ing results associated with the processing unrt Further, 
it is to be understood that one or more processing 
devices within the network may share associated 
resources. Accordingly, software instructions or code tor 
performing the methodologies off the invention, 
described herein, may be stored in one or more off the 
associated memory devices (ROM, fixed or removable 
memory) and. when ready to be utilized, loaded into - 
RAM and executed by a CPUL 

[00101 Referring to FIG. 1, a web user activity log- 
ging system 100 accor di ng to an exemplary embodi- 
ment of the invention is shown. The system 100 
includes a browser 102 having an agent 104 associated 
therewith. As previously explained, the browser 102 is a 
computer software program that runs on a computer of 
the system 100 and allows a user to, for example, place 
information requests to various websites associated 
with network servers. The agent 104 is also a computer 
software program that is run in conjunction with the 
browser 102 on a computer of the system 100 and, as 
wil be explained, provides the necessary functionality 
to perrnit collection of information pertaining to web user 
activity according to the inventioa ft is to be appreciated 
that the agent 104 is executable content, preferably in 
the form of an applet that is dynamically downloaded 
into a client, preferably at the time when the client 
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receives information, e.g., HTML documents, from a 
web server in response to a URL request The applet is 
preferably mobile code downloaded by the web server 
to the client pursuant to a mobile programming lan- 
guage such as, for example, the standard JAVA lan- 
guage. Thus, in this instance, the mobile code may be in 
the form of a JAVA applet. Agent 104 is reafized when 
the applet starts to run within browser 102. 
[0011] Further, the web user activity logging system 
1 00 of the invention includes a web or HTTP server 1 06. 
The web server is coupled to the web browser 102 via a 
commurocation channel 10a ft is to be appreciated that 
FIG. 1 includes only one web server and one cfient web 
browser merely to illustrate the concepts of the inven- 
tion, however, there may be any number of web servers 
and cfient web browsers with agents operating accord- 
ing to the invention. The web server 106 also includes a 
logger 110 and a fog database 112 for storing informa- 
tion, as wil be explained below, in accordance with the 
invention. While the logger 110 and the log database 
112 are shown external to the web server 106 in FIG. 1. 
it is to be appreciated that these components preferably 
reside in the server 106. That is. the logger 110 is a 
computer software program that preferably runs on the 
server 106 and. as wffl be explained, provides the nec- 
essary functionality to permit web user activity logging 
at a web server in conjunction with an agent residing in 
a clients web browser according to the invention. Also, 
the log database 112 is preferably resident on the 
server 106 in the form of memory associated therewith. 
However, ft is to be noted tfiat the logger 1 10 and/or tog 
database 112 may be run on one or more computers 
separate from the web server 106. In which case, the 
computer or computers on which the logger and tog 
database reside may be coupled to the web server 106 
via one or more communication ch a nnels. 
[0012] Acfvantegeousty, when a user makes a 
request to a web server, for example, to retrieve infor- 
mation, an agent is downloaded into the users web 
browser in the form of executable content While the 
agent is preferably downloaded along with the docu- 
ment requested by the cfient. it is to be appreciated that 
the agent may be established in the web browser, upon 
a users request, prior to any request for information. 
However, once in the web browser, the agent keeps 
track of, i.e., collects and stores, all user interaction that 
occurs in the browser, that is, cfient-eide interaction. For 
example, types of user Interaction that may be tracked 
include information concerning what portion of a 
retrieved document (e.g., images, form elements, etc.) a 
user has viewed, what data has been fflled in on a form 
element modifications to that data, what sequence the 
data was filed in. etc. Also, information concerning the 
users page to page navigation through retrieved docu- 
ments, which may be stored in a browser cache, may be 
tracked and provided to the server. Of course, the above 
examples are merely illustrative and, as such, the type 
of user activity tracked is not limited thereto. Also, as 
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mentioned, the granularity of this tracking can be regu- 
lated wflh the use of i n forma ti on filters associated with 
the agent, as will be explained, allowing the agent to 
collect as much information as is available, or some 
subset thereof. The agent then periodicafly notifies the 
server-side entity, La, the logger, of ail the information 
that it has collected. The logger processes and stores 
the information for subsequent use. 
[0013] FIG. 2 ihjstrates a client-server system in 
which web user activity tracking and logging accord ng 
to the invention may be implemented. As shown, a client 
computer 202 and a web server 204 are coupled via a 
communication channel over the WWW network 206. 
Client 202 includes a processor 208 and associated 
memory 210 for, ag.. performing the operations associ- 
ated with the browser 102 and agent 104 according to 
the invention, as described herein. On the other hand, 
the server 204 includes a processor 212 and associated 
memory 21 4 for. ag.. performing the operations associ- 
ated with the logger 1 10 and log database 1 12 accord- 
ing to the invention, as described herein. The memories 
210 and 214 respectively store i n formation associated 
with operations of the processors. Particularly, memory 
210 serves to store the information pertaining to user 
activity tracked by the agent, while memory 214 serves 
to store the intonnation togged by the logger in the form 
of the log database. Further, ft is to be understood that 
the comrrunicafion channel 108 of FIG. 1, may be 
implemented as a TCP/IP connection, as is known in 
the art, over the WWW 206 in FIG. 2. As mentioned, 
whie a single server computer 204 is shown in FIG. 2 
for rrplemerrting the logger and log database, rt is to be 
understood that the logger and log database may be 
implemented on one or more separate computers. 
These computers rray conrnunicate with toe server 
204 via a TCP/IP connection over the WWW 206 or over 
cxxTimunication channels separate from the WWW 206. 
[0014] Referring now to FIG. 3, a ffow chart of a 
web user activity logging method 300 according to an 
exemplary embodiment of the present invention is 
shown. InrtiaJry, in step 302, a user requests a web (ag.. 
HTML) document from a server (ag., 106) using a web 
browser (ag., browser 102). The server returns the 
requested document with mobfle programming lan- 
guage code representing an agent embedded therein to 
the browser, in step 304. Then, in step 306, the agent 
(ag., agent 104) begins running on the users computer 
along with the browser and collects and stores informa- 
tion from the browser pertaining to the users activity, 
e.g., the user's interaction with the downloaded HTML 
document An exemplary errtxxfment of an agent data 
collecting method wil be explained below in the context 
of FIG. 4. Next in step 308, the agent determines 
whether it has data for fogging at the web server. This 
determination can be made based on various conrj- 
tions, e.g., col action of a certain amount of rribrmation, 
collection of a particular type of information, colection 
of data pertaining to any change in browser attrfoutes. 
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etc. If there is data for logging, the agent then transmits 
some or afl of the collected Information as fog informa- 
tion to the web server, in step 310. The transmission of 
the fog information from the agent to the web server 
may be periodic or based on the occurrence of some 
event If there is no data for logging, the agent merely 
continues to wait for data to be collected or probes for 
data changes, as will be explained. Lastfy, in step 312 at 
the web server, the logger (ag., logger 110) receives 
any log information transmitted by the agent and stores 
such data (ag., in the fog database 112) for future usa 
The particular format that the logger stores the ctata in Is 
not critical. However, the data is stored in association 
with (La, indexed to) the particular client from which It 
was received since the web server may likely be track- 
ing and storing client information from other agents 
established in other browsers. 
[0015] Referring now to FIG. 4, a flow chart of ah 
agent data collecting method 400 according to an exem- 
plary embodiment of toe present invention is shown. In 
step 402, the agent passively waits for notification from 
the browser or downloaded document of a change In an 
attribute or attributes associated with toe browser or 
downloaded document That is, attributes (ag., data 
fields, sequence of pages viewed, etc) associated with 
the browser ancVbr downloaded document change as a 
user interacts with the browser ancVbr downloaded doc- 
ument However, trie agent also sets a timer, in step 
404, when rt first begins to wait for change notification. 
The agent monitors whether this timer has expired, in 
step 406L If notification is received, the particular data is 
collected and stored for logging, in step 408. If the timer 
has expired and no data is received, the agent may then 
actively probe or search the browser or downloaded 
document for changed attributes, in step 410. Also, in 
step 410, the agent may setup modified or new change 
notification criteria and may set up information selection 
filers that screen the type of attribute changes that get 
reported to the agent For example, the agent may ena- 
ble a selection titer that provides for collection of only a 
particular type of data. That is, for instance, the agent 
may only collect and store Information indicating which 
pages of a retrieved document the user actually viewed, 
or information tracking the sequence of page to page 
navigation performed by the user. The agent can also 
reset the timer, in step 404. so that a fixed amount of 
time is allotted for waiting for information generated by 
the new or mocfified change notifications andvbr niter 
selections. The timer may also be reset and monitored 
for the active probing step. At the expiration of the timer, 
the agent again checks whether it has received data for 
logging (step 408). rf the agent has received data, ft may 
transmit the data to the logger. If not the agent may 
repeat step 410 to possibly setup modified or new crite- 
ria or to continue probing for changed data. Alterna- 
tively, the agent can return to the passive waiting mode 
with or without resetting the timer, ft is to be appreciated 
that the change notifications and selection filters can be 
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used to narrow the search performed by the agent in the 
probing step. 

[0016] Advantageously, the present invention sim- 
plifies the process of logging in CGI infrastructures 
since individual CGI programs do not have to generate s 
and/or maintain log information. Since the methodolo- 
gies of the invention are server and CGI-technique inde- 
pendent, the present invention provides a unified 
method for coflecting such information. It is to be appre- 
ciated that such an inventive logging facility as to 
descrbed herein may be easily added to existing web 
servers by those of orcfnary sWD in the art without any 
undue experimentation. As a result of the invention, 
avaJabimy of logs of client-side activity and its analysis 
can lead to better understanrJng of user behavior. Par- 75 
ties operating web sites can use this information to 
study how end users interact with toe information they 
provide. While the invention may find appScation in a 
variety of computing enviro nm ent s , the methodologies 
are particularly useful for web-based electronic com- 20 
merce (E-commerce) as wefi as web-based customer 
care. 

[0017] Although illustrative embodiments of the 
present invention have been described herein with ref- 
erence to the accompanying oYawmgs, it is to be under- 25 
stood that the invention is not limited to those precise 
embodiments, and that various other change and mod- 
ifications may be affected therein by one skated in the 
art without departing from the scope of trie invention. 

so 

Ctafrns 

1. A method for use in a server of tracking client activ- 
ity associated wrtti a browser running on at least 
one client device, the server and the at least one di- ss 
ent device being in a computer network, the method 
comprising the steps ofi 

transmitting executable code to the browser 
operable for collecting at least a portion of data 40 
associated with the browser-related client 
activity and for transmitting the data to the 
server; and 

logging the transmitted data associated with 45 
the browser-related client activity at the server. 

2. The method of Claim 1, wherein the executable 
code is transmitted from the server to the browser 
along with irrformation requested by the browser. so 

3. The method of Claim 2. wherein the computer net- 
work is the Internet and the requested information 
includes at teast one HTML document 

4. The method of Claim 2, wherein the coiected data 
is associated with browser-related client activity 
pertaining to the transmitted information. 



5. The method of Claim 1, wherein the executable 
code is in the form an applet associated with a 
mobile rxogramming language. 

6. The method of Claim 5, wherein the mobfle pro- 
gramming language is Java. 

7. A method for use in a client device of tracking client 
activity associated with a browser running on the 
client device in conjunction with a server, the client 
device and the server being in a computer network, 
the method comprising the steps of. 

collecting, in accordance with executable code 
received from the server in the browser, at least 
a portion of data associated with the browser- 
related client activity; and 
transmitting, in accordance wfth the executable 
code, the data associated with the browser- 
related cfient activity to the server for logging 
therein. 

a The method of Claim 7, wherein the executable 
code is received from the server along with informa- 
tion requested by the browser. 

a The method of Claim 8, wherein the computer net- 
work is the Internet and the requested in for m ation 
includes at least one HTML document 

10. The method of Claim 8. wherein the collected data 
is associated with browser-related client activity 
pertaining to the received information. 

11. The method of Claim 7, wherein the executable 
code is in the form an applet associated with a 
mobile pr o gramming language, 

12. The method of Claim 11, wherein the mobile pro- 
gramming lan gu age is Java. 

ia The method of Claim 7, wherein the coflecting step 
further comprises the step of passively waiting tor 
receipt of the data associated with the browser- 
related cfient activity. 

14. The method of Claim 13, wherein the collecting 
step further corrprises the step of setting and mon- 
itoring a timer such that the step of passrvely wait- 
ing for the data is performed tor a preoetermined 
amount of time. 

15. The method of Claim 7, wherein the coflecting step 
further comprises the step of actively probing for 
data associated with the browser-related cfient 
activity. 

16. The method of Claim 15. wherein the collecting 
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step furthef comprises the step of setting and mon- 
itoring a timer such that the step of actively probing 
for the data is performed for a predetermined 
amount of time. 

17. The method of Claim 7. wherein the collecting step 
further comprises the step of establishing notifica- 
tion criteria pertaining to the collection of the data 
associated with the browser-reteted cient actMty. 



w 



1a The method of Claim 17. wherein the col acting 
step further comprises the step of changing the 
notification criteria pertaining to to coflec&xi of tte 
data associated with the browser-related client 
activity. 15 

19. The method of Claim 7, wherein the collecting step 
further comprises the step of applying selection ti- 
ters pertaining to the collection of toe data associ- 
ated wim the browser-related diem activity. 20 

20. Apparatus for use in a server for tracking client 
activity associated with a browser running on at 
least one client device, the server and the at least 
one client device being in a computer network, the 25 
apparatus comprising means arranged to carry out 

a method as ctaimed in any of claims 1 to a 



21. An article of manufacture for use in a 
tracking cient activity associated with a browser so 
runnirig on at least one cient device, the server and 
the at least one client device being in a computer 
network, the article comprising a machine readable 
meoiurn containing one or more programs which 
when executed Implement the steps of a methodas as 
claimed in any of claims 1 to 6. 

22. Apparatus tor use in a cient device for trackiig ci- 
ent activity associated with a browser run^ 

the cfierrt device is conjunction with a server, the cS- 40 
ent device and the server being in a computer net- 
worK the apparatus comprising mearts arranged to 
carry out a method as claimed in any of claims 7 to 
19. 



23. An article of manufacture tor use in a client device 
for tracking client activity associated with a browser 
running on the client device to conjunction with a 
server, the client device and the server being in a 
computer network, the article comprising a so 
machine readable medium containing one or more 
programs which when executed irnplement the 
steps of a method as claimed in any of claims 7 to 
19. 
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